Monte Carlo approximation in incomplete information, sequential auction games
نویسندگان
چکیده
We model sequential, possibly multi-unit, sealed-bid auctions as a sequential game with imperfect and incomplete information. We develop an agent that constructs a bidding policy by sampling the valuation space of its opponents, solving the resulting complete information game, and aggregating the samples into a policy. The constructed policy takes advantage of information learned in the early stages of the game, and is flexible with respect to assumptions about the other bidders’ valuations. Because the straightforward expansion of the incomplete information game is intractable, we develop a more concise representation that takes advantage of the sequential auctions’ natural structure. We examine the performance of our agent versus agents that play perfectly, agents that also create policies using Monte Carlo, and other benchmarks. The technique performs quite well in these empirical studies, though the tractability of the problem is bounded by the ability to solve component games.
منابع مشابه
Flexible Decision-Making in Sequential Auctions
CAI, GANGSHU. Flexible Decision-Making in Sequential Auctions. (Under the direction of Assistant Professor Peter R. Wurman). Because sequential auctions have permeated society more than ever, it is desirable for participants to have the optimal strategies beforehand. However, finding closed-form solutions to various sequential auction games is challenging. Current literature provides some answe...
متن کاملMarkov Chain Monte Carlo for Incomplete Information Discrete Games
This paper outlines a Bayesian approach to estimating discrete games of incomplete information. The MCMC routine proposed features two changes to the traditional Metropolis-Hastings algorithm to facilitate the estimation of games. First, we propose a new approach to sample equilibrium probabilities using a probabilistic equilibrium selection rule that allows for the evaluation of the parameter ...
متن کاملMonte Carlo Sampling for Regret Minimization in Extensive Games
Sequential decision-making with multiple agents and imperfect information is commonly modeled as an extensive game. One efficient method for computing Nash equilibria in large, zero-sum, imperfect information games is counterfactual regret minimization (CFR). In the domain of poker, CFR has proven effective, particularly when using a domain-specific augmentation involving chance outcome samplin...
متن کاملStochastic Constraint Programming for General Game Playing with Imperfect Information
The game description language with incomplete information (GDL-II) is expressive enough to capture partially observable stochastic multi-agent games. Unfortunately, such expressiveness does not come without a price: the problem of finding a winning strategy is NEXPNP-hard, a complexity class which is far beyond the reach of modern constraint solvers. In this paper, we identify a PSPACE-complete...
متن کاملInference on Auctions with Weak Assumptions on Information
Given a sample of bids from independent auctions, this paper examines the question of inference on auction fundamentals (e.g. valuation distributions, welfare measures) under weak assumptions on information structure. The question is important as it allows us to learn about the valuation distribution in a robust way, i.e., without assuming that a particular information structure holds across ob...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Decision Support Systems
دوره 39 شماره
صفحات -
تاریخ انتشار 2005